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Abstract 

The high repair cost of {n, k) Maximum Distance Separable (MDS) erasure codes has recently 
Q , motivated a new class of codes, called Regenerating Codes, that optimally trade off storage cost for repair 

bandwidth. In this paper, we address bandwidth-optimal {n, k, d) Exact-Repair MDS codes, which allow 
^ ■ for any failed node to be repaired exactly with access to arbitrary d survivor nodes, where k < d < n — 1. 

m ■ 

\^ ' We show the existence of Exact-Repair MDS codes that achieve minimum repair bandwidth (matching 



X 



the cutset lower bound) for arbitrary admissible {n, k, d), i.e., k < n and k < d < n — 1. Our approach 



i is based on interference alignment techniques and uses vector hnear codes which allow to split symbols 

■ into arbitrarily small subsymbols. 



Index Terms 



. Exact Repair Codes, MDS Codes, Interference Alignment 



I. Introduction 



In distributed storage systems, maximum distance separable (MDS) erasure codes are well- 
known coding schemes that can offer maximum reliability for a given storage overhead. For an 
{n, k) MDS code for storage, a source file of size M bits is divided equally into k units (of 
size ^ bits each), and these k data units are expanded into n encoded units, and stored at n 
nodes. The code guarantees that a user or Data Collector (DC) can reconstruct the source file by 
connecting to any arbitrary k nodes. In other words, any (n — k) node failures can be tolerated 
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with a minimum storage cost of ^ at each of n nodes. While MDS codes are optimal in terms of 
reliability versus storage overhead, they come with a significant maintenance overhead when it 
comes to repairing failed encoded nodes to restore the MDS system-wide property. Specifically, 
consider failure of a single encoded node and the cost needed to restore this node. It can be 
shown that this repair incurs an aggregate cost of M. bits of information from k nodes. Since 
each encoded unit contains only ^ bits of information, this represents a fc-fold inefficiency with 
respect to the repair bandwidth. 

This challenge has motivated a new class of coding schemes, called Regenerating Codes 
[III, 0, which target the information-theoretic optimal tradeoff between storage cost and repair 
bandwidth. On one end of this spectrum of Regenerating Codes are Minimum Storage Regen- 
erating (MSR) repair codes that can match the minimum storage cost of MDS codes while also 
significantly reducing repair bandwidth. As shown in [jT||, [|2l, the fundamental tradeoff between 
bandwidth and storage depends on the number of nodes that are connected to repair a failed 
node, simply called the degee d where k < d < n — \. The optimal tradeoff is characterized by 



where a and 7 denote the optimal storage cost and repair bandwidth, respectively for repairing 
a single failed node, while retaining the MDS-code property for the user. Note that this code 
requires the same minimal storage cost (of size ^) as that of conventional MDS codes, while 



there is a 5x bandwidth reduction). MSR (n, k, d) repair codes can be considered as Repair 
MDS codes that (a) have an (n, A;) MDS-code property; and (6) can repair single-node failures 
with minimum repair bandwidth given a repair-degree of d. Throughout this paper, we will use 
Repair MDS codes to indicate MSR repair codes. 

While Repair MDS codes enjoy substantial benefits over conventional MDS codes, they come 
with some limitations in construction. Specifically, the achievable schemes in |l2l that meet 
the optimal tradeoff bound of ([B restore failed nodes in a functional manner only, using a 
random-network-coding based framework. This means that the replacement nodes maintain the 
MDS-code property (that any k out of n nodes can allow for the data to be reconstructed) but 
do not exactly replicate the information content of the failed nodes. 

Mere functional repair can be limiting. First, in many applications of interest, there is a need to 




(1) 



substantially reducing repair bandwidth by a factor of ^'('^ (e.g., for (n, k, d) = (31, 6, 30), 



April 28, 2010 



DRAFT 



3 



maintain the code in systematic form, i.e., where the user data in the form of k information units 
are exactly stored at k nodes and parity information (mixtures of k information units) are stored 
at the remaining [n—k) nodes. Secondly, under functional repair, additional overhead information 
needs to be exchanged for continually updating repairing-and-decoding rules whenever a failure 
occurs. This can significantly increase system overhead. A third problem is that the random- 
network-coding based solution of [HI can require a huge finite-field size, which can significantly 
increase the computational complexity of encoding- and-decodingl|. Lastly, functional repair is 
undesirable in storage security applications in the face of eavesdroppers. In this case, information 
leakage occurs continually due to the dynamics of repairing-and-decoding rules that can be 
potentially observed by eavesdroppers [|3l. 

These drawbacks motivate the need for exact repair of failed nodes. This leads to the following 
question: is there a price for attaining the optimal tradeoff of © with the extra constraint of 
exact repair? The work in H considers partial exact repair (where only systematic nodes are 
repaired exactly), while the work in [|5 1 considers exact repair of all nodes, giving a clear answer 
with deterministic scalar linear code^ having small alphabet size for the case of ^ < | (and 
d > 2k — 1): it was shown that for this regime, there is no price even with the extra constraint of 
exact repair. What about for either ^ > ^ or k < d < 2k — 11 The work in [4] sheds some light 
on this case: specifically, it was shown that under scalar linear codes, when either ^ > | + ^ or 
k + 1 < d < max{k + 1,2k — A), there is a price for exact repair. What if non-linear or vector 
linear codes are used? The tightness of the optimal tradeoff of ([T]) under these assumptions has 
remained open. In this paper, we show that using vector linear codes, the optimal tradeoff of ([T]) 
can be indeed attained /or all admissible values of {n, k, d), i.e., k < n and k < d < n — 1. That 
is if we are willing to deal with arbitrarily small subsymbols, then Exact-Repair MDS codes can 
come with no loss of optimality over functional-repair MDS codes. Note that we will use this 
definition of admissibility throughout the paper. 

Our achievable scheme builds on the concept of interference alignment, which was introduced 

'in d, Dimakis-Godfrey-Wu-Wainwright-Ramchandran translated the regenerating-codes problem into a multicast communi- 
cation problem where random-network-coding-based schemes require a huge field size especially for large networks. In storage 
problems, the field size issue is further aggravated by the need to support a dynamically expanding network size due to the need 
for continual repair. 

^In scalar linear codes, symbols are not allowed to be split into arbitrarily small subsymbols as with vector linear codes. 
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in the context of wireless communication networks |l6l, In particular, the interference align- 
ment scheme in [7] that permits an arbitrarily large number of symbol extensions (i.e., vector 
linear codes) forms the basis of our results here. The results in [4] say that under scalar linear 
codes, the case of either ^ > or k+1 < d < max(A;+l, 2k— A) induces more constraints than 
the available number of design variables. This parallels the problem encountered by Cadambe and 
Jafar in [|7]| in the conceptually similar but physically different context of wireless interference 
channels. Cadambe and Jafar resolve this issue in |I71 using the idea of symbol-extension, which is 
analogous to the idea of vector linear codes for the distributed storage repair problem studied here. 
Building on the connection described in {5\ between the wireless interference and the distributed 
storage repair problems, we leverage the scheme introduced in |I71 to the repair problem, showing 
the existence of Exact-Repair MDS codes that achieve minimum repair bandwidth (matching 
the cutset lower bound) for all admissible values of (n, k, d). 

II. Interference Alignment for Exact-Repair MDS Codes 

Linear network coding |[8l, ^ (that allows multiple messages to be linearly combined at 
network nodes) has been established recently as a useful tool for addressing interference issues 
even in wireline networks where all the communication links are orthogonal and non-interfering. 
This attribute was first observed in [lOJ, where it was shown that interference alignment could 
be exploited for storage networks, specifically for Exact-Repair MDS codes having small k 
(k = 2). However, generalizing interference alignment to large values of k (even A; = 3) proves 
to be challenging, as we describe in the sequel. In order to appreciate this better, let us first 
review the scheme of [fTOl that was applied to the exact repair problem. We will then address 
the difficulty of extending interference alignment for larger systems and describe how to address 
this in Section [nil 

A. Review of (4, 2) Exact-Repair MDS Codes / fTOl/ 

Fig.IHillustrates an interference alignment scheme for a (4, 2) Exact-Repair MDS code defined 
over GF(5). First one can easily check the MDS property of the code, i.e., all the source files 
can be reconstructed from any k{= 2) nodes out of n(= 4) nodes. As an illustration, let us see 
how failed node 1 (storing (01,02)) can be exactly repaired. We assume that the degree d is 3, 
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Fig. 1. Interference alignment for a (4, 2) Exact-Repair MDS code defined over GF(5) 1101 . Designing appropriate projection 
vectors, we can align interference space of (61,62) into one-dimensional linear space spanned by [1, 1]*. As a result, we can 
successfully decode 2 desired unknowns (01,02) from 3 equations containing 4 unknowns (01,02,61,62). 

and a source file size M is 4. The cutset bound ([B then gives the fundamental limits of: storage 
cost a = 2; and repair-bandwidth-per-link f3 := ^ = 1. 

The example illustrated in Fig. \T\ shows that the parameter set described above is achievable 
using interference alignment. Here is a summary of the scheme. Recall that the bandwidth- 
per-link is /3 = 1 and we use a scalar linear code, i.e., each symbol has unit capacity and 
cannot be split into arbitrarily small subsymbols. Hence, each survivor node uses a projection 
vector to project its data into a scalar. Choosing appropriate projection vectors, we get the 
equations as shown in Fig. [TJ {bi + 62); oi + 2a2 + (61 + ^2); 2ai + 02 + (^1 + ^2)- Observe that 
the undesired signals (^1,62) (interference) are aligned onto an 1-dimensional linear subspace, 
thereby achieving interference alignment. Therefore, we can successfully decode (01,02) with 
three equations although there are four unknowns. Similarly, we can repair (61,62) when it has 
failed. 

For parity node repair, a remapping technique is introduced. The idea is to define parity node 
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^gWailed 



node 2 



node 3 



{parity node 1) 2^2 



node 4 
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How to repair? 
ai 



02 



Interference alignment 
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Fig. 2. Geometric interpretation of interference alignment. The blue solid-line and red dashed-line vectors indicate linear 
subspaces with respect to "a" and "b", respectively. The choice of Vq2 ~ Bj^^v^i and Vca = B^^Vd enables interference 
alignment. For the specific example of Fig. [T] the corresponding encoding matrices are Ai = [1,0; 0,2], Bi = [1,0; 0,1]. 
Aa = [2,0;0, 1], Ba = [1,0; 0,1]. 



symbols with new variables as follows: 

Node 3: a[ := ai + bi; a'2 := 2a2 + 62; 
Node 4: h\ := 2ai + 61; b'^ := 02 + 62- 

We can then rewrite (ai, 02) and (61, 62) with respect to {a[, a'2) and {b\, b^)- In terms of prime 
notation, parity nodes turn into systematic nodes and vice versa. With this remapping, one can 
easily design projection vectors for exact repair of parity nodes. 

B. Geometric Interpretation 

Using matrix notation, we provide geometric interpretation of interference alignment for the 
same example in Fig. [T] Let a = (ai, 02)* and b = (61, ^2)* be 2-dimensional information- unit 
vectors, where (■)* indicates a transpose. Let Aj and Bj be 2-by-2 encoding submatrices for 
parity node i (z = 1, 2). Finally we define 2-dimensional projection vectors v^i's {i = 1, 2, 3). 

Let us consider exact repair of systematic node 1. By connecting to three nodes, we get: 
b*VQ,i; a*(AiVQ,2) + b*(BiVQ,2); a*(A2VQ,3) + b*(B2VQ,3). Recall the goal of decoding 2 desired 
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unknowns out of 3 equations including 4 unknowns. To achieve this goal, we need: 



rank 





(AiV„2)* 


j = 2; rank 


( 






(A2V,3)* 





VL 



'al 



(BiV,2)* 
(B2V„3)* 



/ 



1. 



(2) 



The second condition can be met by setting Vq,2 = B^^Vd and Vq,3 = This choice 

forces the interference space to be collapsed into a one-dimensional linear subspace, thereby 
achieving interference alignment. With this setting, the first condition now becomes 

rank ( [AiB^^v^ A2B2- ) = 2. (3) 

It can be easily verified that the choice of A/s and Bj's given in Figs. [T] and [2] guarantees the 
above condition. When the node 2 fails, we get a similar condition: 

rank ( [Bi A^'v^i B2 A2- V^^i] ) = 2, (4) 

where v^j's denote projection vectors for node 2 repair. This condition also holds under the given 
choice of encoding matrices. With this remapping, one can easily design projection vectors for 
exact repair of parity nodes. 



C. Connection with Interference Channels in Communication Problems 
Observe the three equations shown in Fig. |2l 








V* 


(AiV,2)* 


a + 


(BiV,2)* 


(A2V,3)* 




(B2V„3)* 



desired signals interference 

Separating into two parts, we can view this problem as a wireless communication problem, 
wherein a subset of the information is desired to be decoded in the presence of interference. 
Note that for each term (e.g., AiVq,2), the matrix Ai and vector Vq,2 correspond to channel 
matrix and transmission vector in wireless communication problems, respectively. 

There are, however, significant differences. In the wireless communication problem, the chan- 
nel matrices are provided by nature and therefore not controllable. The transmission strategy 
alone (vector variables) can be controlled for achieving interference alignment. On the other 
hand, in our storage repair problems, both matrices and vectors are controllable, i.e., projection 



April 28, 2010 



DRAFT 



8 



vectors and encoding matrices can be arbitrarily designed, resulting in more flexibility. However, 
our storage repair problem comes with unparalleled challenges due to the MDS requirement and 
the multiple failure configurations. These induce multiple interference alignment constraints that 
need to be simultaneously satisfied. What makes this difficult is that the encoding matrices, 
once designed, must be the same for all repair configurations. This is particularly acute for large 
values of k (even k = 3), as the number of possible failure configurations increases with n 
(which increases with k). 

III. A Proposed Framework for Exact-Repair MDS Codes 

We propose a conceptual framework based on vector linear codes to address the exact repair 
problem. As described earlier, this framework is based on that of interference alignment for 
wireless channels in Q. We leverage the connection between the two problems to develop Exact- 
Repair MDS codes that are optimal in repair bandwidth for all admissible values of (n, k, d). 

Our framework consists of four components: (1) developing a code structure for exact repair 
of systematic nodes based on the vector linear codes; (2) drawing a dual structure between 
the systematic and parity node repair; (3) guaranteeing the MDS-code property; (4) providing a 
probabilistic guarantee of the existence of the code for a large enough alphabet size. In particular, 
the diagonal structure of single-antenna wireless channels (exploited in [7 J) forms the basis of 
the structure of encoding submatrices of our codes. The framework covers all admissible values 
of (n, k, d). This contrasts the scalar- linear code based framework in [[5]| which covers the case 
of ^ < I and d>2k — 1, but which provides deterministic codes with small alphabet size and 
guaranteed zero error. Furthermore, addressing different code parameters in the case of ^ < i 
and d > 2k — 1 requires specific attention, such as the design of puncturing codes introduced 
in flU. See BH for details. In contrast, here we target only the existence of exact-repair codes 
without specifying constructions. This allows for a simpler characterization of the solution space 
for the entire range of admissible repair code parameters. In order to convey the concepts in 
a clear and concise manner, we first focus on the simplest example which does not belong to 
the framework in 10: (6, 3,4) Exact-Repair MDS codes. This example is a representative of the 
general case of A; < tt, and k < d < n — 1, with the generalization following in a straightforward 
way from this example. This will be discussed in Section ITVl 
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Fig. 3. Difficulty of achieving interference alignment simultaneously when using scalar linear codes 



A. Systematic Node Repair 

For k >3 (more-than-two interfering information units), achieving interference alignment for 
exact repair turns out to be significantly more complex than the k = 2 case. Fig. [3] illustrates 
this difficulty through the example of repairing node 1 for a (6,3,4) code. In accordance with 
the (4, 2) code example in Figs. \T\ and [2l we choose = 6 so that repair-bandwidth-per-link 
has unit capacity (/3 := ^ = 1). By the optimal tradeoff of ©, this gives a = 2. Suppose that 
we use scalar linear codes, i.e., each symbol has unit capacity and cannot be chopped up into 
arbitrarily smaller chunks. We define a = (01,02)*, b = (61, ^2)* and c = (ci,C2)*. We define 
2-by-2 encoding submatrices of Aj, Bj and Cj (for i = 1,2,3); and 2-dimensional projection 
vectors v^i's. 

Suppose that survivor nodes (2, 3, 4, 5) are connected for exact repair of node 1. We then get 
the 4 (= d) equations: 








V* 

al 









(AiV„3)* 


a + 




(BiV„3)* 


b + 


(CiV,3)* 


. (A2V„4)* _ 
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symbol 



Scalar linear code Vector linear code 




subsymbol 



Fig. 4. Illustration of tiie idea of vector linear codes through storage node 1 in the (6, 3, 4) code example. In scalar linear 
codes, symbols are not allowed to be split. On the other hand, vector linear codes allow to split symbols into arbitrarily small 
subsymbols. In this example, node 1 stores a — 2 symbols, each of which has unit capacity. In vector linear codes, this unit- 
capacity symbol can be split into subsymbols with arbitrarily small capacity. For example, we can split each symbol into B 
number of subsymbols, so each subsymbol has i capacity. 



In order to successfully recover the desired signal components of "a", the matrices associated 
with b and c should have rank 1, respectively, while the matrix associated with a should have full 
rank of 3. In accordance with the (4, 2) code example in Fig. |2l if one were to set Vq,3 = B^^ 
and Vq,4 = B^^Vai, then it is possible to achieve interference alignment with respect to b. 
However, this choice also specifies the interference space of c. If the Bj's and C/s are not 
designed judiciously, interference alignment is not guaranteed for c. Hence, it is not evident 
how to achieve interference alignment at the same time. 

In order to address the challenge of simultaneous interference alignment, we invoke the idea of 
symbol extension introduced in [7J, which is equivalent to the concept of vector linear codes in 
the storage repair problem. Fig. |4] illustrates the idea of vector linear codes through storage node 
1 in the (6,3,4) code example. While scalar linear codes do not allow symbol splitting, vector 
linear codes permit the splitting of symbols into arbitrarily small subsymbols. In this example, 
each node stores a = 2 symbols, each of which has unit capacity. In vector linear codes, this 
unit-capacity symbol is allowed to be split into subsymbols with arbitrary small capacity. In 
this example, we split each symbol into B number of subsymbols, so each subsymbol has 
capacity. 
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a* As + b*B2 


+ C*C2 




a* A3 + b*B3 


+ C*C3 




V = {(B^^B^^q^q-) w : ei, 62, 63, 64 G {1, • • • , m}} 

V = {(B^^B^^C^^C^-) w : ei, 62, eg, 64 E {1, • • ■ , m + 1}} 



Fig. 5. Illustration of exact repair of systematic node 1 for (6, 3, 4) Exact-Repair MDS codes. We split each symbol into 
B = number of subsymbols, where m is an arbitrarily large positive integer and the exponent N is equal to 4 and is 
carefully chosen depending on code parameters, i.e., A'^ — {k — l){d — k + 1) =4. This corresponds to the total number of 
encoding submatrices involved in the connection except for those associated with desired signals. Note that each subsymbol has 
capacity. The maximum file size (based on the optimal tradeoff of (TJ) is = 6 units, inducing a storage cost a = 2 units. 
Hence, each storage contains 2m^ number of subsymbols and the size of encoding submatrices is 2m*-by-2m^. We consider 
diagonal encoding submatrices. A failed node is exactly repaired by having systematic and parity survivor nodes project their 
data onto linear subspaces spanned by column vectors of V := [vi,-- - ,V(„j^i)4] and V := [vi,--- ,v,„4], respectively. 
Here Vi G V and Vi G V. Notice that BiVi, B2Vi, CiVi, C2Vi G V,Vi = 1, • • ■ ,m^. Hence, the matrix associated with 
interference b has rank of at most (m +1)* instead of 2m'*. Similarly the matrix associated with interference c has rank of at 
most (m + 1)*. This enables simultaneous interference alignment as m ^ 00. On the other hand, rank[AiV, A2V] — 2m'* 
with probability 1, providing probabilistic guarantee of decodability of desired signals. Finally, notice that total repair bandwidth 
7 = 2 ^""^4 ^ — h 2 ■ 1 approaches the cutset lower bound of 4 units as m goes to infinity. Therefore, we can ensure exact repair 
of systematic node 1 with minimum repair bandwidth matching the cutset lower bound. 



This idea of vector linear codes is key to interference alignment for the storage repair problem. 
Fig. [5] illustrates exact repair of systematic node 1 for (6,3,4) Exact-Repair MDS codes. Using 
vector linear codes, we split each symbol into B = number of subsymbols, where m is 
an arbitrarily large positive integer and the exponent is carefully chosen depending on code 
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parameters. Specifically, 



A^= {k-l){d-k + l). 



(5) 



This choice of and the form of -B = are closely related to the scheme to be described in 
the sequel. In this example, = 4. The maximum file size (based on the cutset bound of ([1])) 
is = 6 units, inducing a storage cost a = 2 units. Since each subsymbol has ^ capacity, 
each storage contains am^{= 2m^) number of subsymbols, e.g., a* = (ci, ■ ■ ■ , a2m*), where 
indicates a subsymbol. Note that the size of encoding submatrices (Aj,Bj,Cj) is 2m'^-hy-2'm'^. 
We consider diagonal encoding submatrices. As pointed out in the diagonal matrix structure 
ensures commutativity and this property provides the key to the interference alignment scheme 
(to be described shortly): 

■■■ 



A,; 



ai,2 







[commutative property holds). 



(6) 



■ ■ ■ ai,2m4 

A failed node 1 is exactly repaired through the following steps. Suppose without loss of 
generality that survivor nodes (2,3,4,5) are used for exact repair of node 1, i.e., k — 1 = 2 
systematic nodes and d — k + 1 = 2 parity nodes. One can alternatively use 1 systematic node 
and 3 parity nodes for repair instead. This does not fundamentally alter the analysis, and will 
be covered in Remark [H in the next section. For the time being, assume the above configuration 
for the connection: k — 1 systematic nodes and d — k + 1 parity nodes. Parity survivor nodes 
project their data using the following projection matrix: 



V:= [vi,... ,v„4] GFf 
where Vj G V. The set V is defined as: 



(7) 



V := {(B^^B^^C? C^*) w : ei, es, eg, 64 G {1, ■ ■ ■ , m}} , (8) 
where w = [I,-- - ,1]*. Note that |V| < m^. The vector maps to a different sequence of 
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13 

(ei, 62, 63, 64). For example, we can map: 

Vi = B1B2C1C2W, 
V2 = B?B2CiC2W, 
V3 = B?B2CiC2W, 

TD m TD m /-I m /-I m — 1 „ , 
Vm4-1 = -tJi 132 '-^l '-^2 W, 

Vm4 = Bi B2 U2 W. 

Let us consider the equations downloaded from parity node 1 and 2 (node 4 and 5): 

a*(AiV) + b*(BiV) + c*(CiV); 

(10) 

a*(A2V) + b*(B2V) + c*(C2V). 

Note that by ([8]), any column vector in [BiV, B2V] or [CiV, C2V] is an element of V defined 
as: 

V := {(B^B^^Cf C^*) w : ei, 62, es, 64 e {1, ■ ■ ■ , m + 1}} . (11) 

This implies that rank[BiV, B2V] < (m + 1)^ and rankfCiV, C2V] < (m + 1)^ This allows 
for simultaneous interference alignment. Systematic survivor nodes project their data using the 
following projection matrix: 

V:=[vl,■■■,V(„+l)4]GFJ™'x(-+l)^ (12) 

where Vj G V. We also map Vj to a difference sequence of (61, 62, 63, 64) as in (|9l). We can then 
guarantee that: 

span[BiV,B2V] C span[V] 

(13) 

span[CiV,C2V] C span[V]. 
Hence, we can completely get rid of any interference. Now let us analyze the decodability of 
the desired signal vector. To successfully recover a, we need: 

rank[AiV, A2V] = 2m^ (14) 

Using standard arguments based on the technique in fT'l and Schwartz-Zippel lemma fill, 
we can ensure the condition of (fT4l) with probability 1 for a sufficiently large field size q. 
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Specifically, we randomly and uniformly choose each diagonal element (non-zero) of all of the 
encoding submatrices in Fg. We then compute the determinant of [AiV, A2V] by adapting the 
technique in [7|. Using Schwartz-Zippel lemma [11], we can then show that the probability that 
the polynomial of the determinant is identically zero goes to zero for a sufficiently large field 
size. The proof is tedious and therefore we omit details. See [|71, [fTTI for details. 
We now validate that total repair bandwidth is: 



7 = (fc-l) 



{m + iy 



m 



mr 



+ [d-k + l)- — 



2^ — + 2 ■ 1 



(15) 



— > 4 units. 

The first equality is because each subsymbol has capacity of and we use projection matrix 
V G Fg"* Y g ■^2m*xm* whcn connecting to systematic nodes and parity nodes, 

respectively. Note that as m goes to infinity, total repair bandwidth approaches minimum repair 
bandwidth matching the cutset lower bound of ([I]). 



B. Dual Relationship between Systematic and Parity Node Repair 

We will show that parity nodes can be repaired by drawing a dual relationship with systematic 
nodes. The procedure has two steps. The first is to remap parity nodes with a', b', and c', 
respectively: 

A* B* C* 







a' 




b' 




c' 





A* 

A* 
^3 



Bt r^t 
2 '-^2 



^3 



C 



a 
b 
c 



Systematic nodes can then be rewritten in terms of the prime notations: 

a* = a'*A; + b'*B; + c'*C;, 

b* = a'* a; + b'*B; + c'*C' 



-.t 



Si"A'^ + b'*B'3 + c"C^, 



(16) 



where the newly mapped encoding matrices (A-, B^, Cj)'s are defined as: 

-1 -1 

Ai A2 A3 



A[ A'2 A3 
B'l B^ 
C'l C2 C3 



Bi B2 B3 
Ci C2 C3 



(17) 
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a'*A'i 


f b"Bi + c'*Ci 






f h'% + c'*c;. 




ei'*A;, 4 


- b'*B;5 + c'*c;i 





Goal: rank = 2m^ rank < (m + 1)^ 



V = {(B'l-^^B'f C'l^^C;*) w : 61,62,63,64 e {!,■■■ ,m}} 
V' = { {B[''B'i'C[''C'^'') w : 61, 62, 63, 64 G {1, ■ • ■ , m + 1}} 




rank < (m + 1) 



Fig. 6. Illustration of exact repair of parity node 1 for (6,3,4) Exact-Repair MDS codes. Notice that A^'s, B^'s, and C^'s 
are also diagonal matrices, since these matrices are functions of diagonal matrices Ai's, B^'s, and Ci's. Survivor nodes 1 and 
2 project their data onto linear subspaces spanned by column vectors of V' ~ [v'l, ■ ■ ■ , v^4]. Here G V'. Notice that 
B'lVi, B2Vi, C'lVi, C'2Vi £ V,\/i — 1, ■ ■ ■ , m**. Hence, the matrix associated with interference b' has rank of at most (m+ 1)* 
instead of 2m*. Similarly the matrix associated with interference c' has rank of at most (m + l)**. This enables simultaneous 
interference alignment as m — !> oo. Survival nodes 5 and 6 project their data using V := [vi, • • • , V(„^j^j4] where Vi G V. 
We can then clean out any interference. On the other hand, it is guaranteed that ran k[A'iV', A'aV] = 2m* with probabihty 1, 
guaranteeing of decodability of desired signals with probability 1. 



As in Section ITlI-A[ we consider random construction of the code, i.e., each diagonal element in 
each encoding submatrix is i.i.d. uniformly drawn from \ {0}. Then, for a sufficiently large 
field size, the above composite matrix has non-zero determinant with probability 1 (again due 
to Schwartz-Zippel lemma). With this remapping, one can now dualize the relationship between 
systematic and parity node repair. Specifically, if all of the A-'s, B-'s, and C^'s are diagonal 
matrices, then exact repair of the parity nodes becomes transparent, as illustrated in Fig. [6l 
Indeed A^'s, B^'s, and C-'s are diagonal matrices, since these matrices are functions of diagonal 
matrices Aj's, Bj's, and C/s. Therefore, following the same procedure in Section ITlI-Al we can 
guarantee exact repair of parity nodes with probability 1. 

Remark 1 (Connecting to arbitrary d nodes suffice for exact repair): In Section Ull-Al we con- 
sidered the only one connection configuration for exact repair: connecting to A; — 1 systematic 
nodes and d — k + \ parity nodes. We now consider other connection configurations. For example, 
consider the case when node 1 fails, as shown in Fig. [51 Suppose we connect to nodes (2, 4, 5, 6) 
for exact repair of node 1: 1 systematic node and 3 parity nodes. The idea is to remap one parity 
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node to make it look like a systematic node. We then virtually connect to 2 systematic and 2 
parity nodes. Specifically, we can remap node 6 with c'*: 

c'* = a* A3 + b*B3 + c*C3 (18) 

We can then rewrite node 4 and 5 in terms of a, b and c' and therefore we virtually have 
connection with 2 systematic and 2 parity nodes. Note that corresponding encoding submatrices 
after remappring are still diagonal matrices. Hence, we can apply the same procedures as those 
in Section Illl-AI 



C. The MDS-Code Property 

The third part of our framework is to guarantee the MDS-code property. Consider all four 
possibilities corresponding to the Data Collector (DC) contacting (1) 3 systematic nodes; (2) 3 
parity nodes; (3) 1 systematic and 2 parity nodes; (4) 1 systematic and 2 parity nodes. 

The first is a trivial case. The second case has been already verified in the process of forming 
the dual structure. The third case requires the invertibility of all of each encoding submatrix. In 
this case, it is obvious since encoding submatrix is diagonal and each element is non-zero. The 
last case is also easy to check. Consider a specific example where the DC connects to nodes 3, 
4 and 5. In this case, we first recover c from node 3 and subtract the terms associated with c 
from nodes 4 and 5. We then get: 

Ai A2 

(19) 

Bi B2 

Again, using the technique in fi\ and Schwartz-Zippel lemma, for a sufficiently large field size, 
this composite matrix has non-zero determinant with probability 1. 



a* b* 



D. Existence of Codes 

As mentioned several times, for a sufficiently large field size, a random construction for 
encoding submatrices suffices to guarantee exact repair of all nodes and MDS-code property 
with probability 1. Hence, we obtain the following theorem. 

Lemma 1 ((6,3,4) Exact-Repair MDS Codes): There exist vector linear Exact-Repair MDS 
codes that achieve the minimum repair bandwidth corresponding to the cutset bound of ([T]), 
allowing for any failed node to be exactly repaired with access to any arbitrary = 4 survivor 



April 28, 2010 



DRAFT 



17 



nodes, provided storage symbols can be split into a sufficiently large number of subsymbols, 
and the field size can be made sufficiently large. 

IV. Generalization 

As one can easily see, the interference alignment technique described in Section IIII-AI can be 
generalized to all admissible values of (n, k, d), i.e., k < n and k < d < n ~ 1. 

Theorem 1 ( {n, k, d) Exact-Repair MDS Codes): There exist vector linear Exact-Repair MDS 
codes that achieve the minimum repair bandwidth corresponding to the cutset bound of 
allowing for any failed node to be exactly repaired with access to any arbitrary d survivor 
nodes, where k < d < n — 1, provided storage symbols can be split into a sufficiently large 
number of subsymbols, and the field size can be made sufficiently large. 

Proof: In the interests of conceptual simplicity, and to parallel the analysis of the (6,4,3) 
example described earlier, we provide only a sketch of the proof for the general case. This can 
be formalized to be precise at the cost of much heavier notational clutter, which we consciously 
avoid. 

Systematic Node Repair: Let Cj*'* indicate an encoding submatrix for parity node i, associated 
with information unit where 1 < i < n — k and 1 < / < A;. Let w/ be Ith information-unit 
vector. Without loss of generality, consider exact repair of systematic node 1 . Using vector linear 
codes, we split each symbol into B = number of subsymbols, where m is an arbitrarily 
large positive integer and the exponent is given by 

N = {k-l){d-k + l). (20) 

The maximum file size (based on the cutset bound of ([B) is = k(d — k + 1) units, inducing a 
storage cost a = d — k + 1 units. Since each subsymbol has ^ capacity, each storage contains 
am^{= (d — k + l)m^) number of subsymbols. Note that the size of encoding submatrices is 

am^-hy-am'^. 

A failed node 1 is exactly repaired through the following steps. Suppose without loss of 
generality that we connect k — 1 systematic nodes and first d — k + 1 parity nodej^. Parity 

'As mentioned earlier, we can convert the other connection configurations into this particular configuration with the remapping 
technique 
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survivor nodes project their data using the following projection matrix: 

V:=[vl,■••,v„^.]GF7"x-", (21) 
where v, G V. The set V is defined as: 



V:=<' n [Gf^j'''w:e,,zG{l,---,m}}>, (22) 

where w = [1, ■ ■ ■ , 1]*. Note that |V| < . 

Let us consider the equations downloaded from parity nodes: 

w^gS^V) + w* (G«V) + ■ ■ ■ + wt(G«V); 

: (23) 

w\{G^f-'^'W) + w*(G^'-'+'V) + ■ ■ ■ + w* (g1'-'+'V). 

Note that by for / ^ 1, any column vector in [G^^V, ■ • • , aj'^'^^^V] is an element of V 
defined as: 

V:=i n [Gf)]'"'w:e,,,G{l,---,m + l}i, (24) 

Ki=l,--- ,d-k+l,l=2,--- ,k ) 

This implies that for / ^ 1, rank[Gp V, ■ ■ ■ , Q^^-^+^^Y] < (m + 1)^. This allows for simulta- 
neous interference alignment. Systematic survivor nodes project their data using the following 
projection matrix: 

V := [vi, ■ ■ ■ , V(„+i).] G F7"x('-+i)", (25) 
where Vj G V. We can then guarantee that for / 7^ 1: 

span[GpV, ■ • • , gJ-'-'+'V] c span[V]. (26) 

Hence, we can clean out any interference. Now let us consider the decodability of desired signals. 
To successfully recover wi, we need: 

rank[GS' V, ■ ■ ■ , Gi''~'+' V] = {d-k + l)m^ = am^. (27) 

Using the technique in fT| and Schwartz-Zippel lemma fTTl, we can ensure the (fT4l) with 
probability 1 for a sufficiently large field size. 
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Finally we validate that total repair bandwidth is: 



7= (A; - 1)^ ^ + {d-k + V 



(28) 



d. 

Note that as m goes to infinity, total repair bandwidth approaches minimum repair bandwidth 
matching the cutset lower bound of ([I]). 

Parity Node Repair: As discussed in Section ITlI-Bi we can draw a dual structure by remapping 
parity nodes with primed new notations. The key observation is that newly mapped encoding sub- 
matrices are still diagonal matrices. Hence, we can apply the same technique used in systematic 
node repair. 

MDS-Code Property: We check the invertibility of a composite matrix when a Data Collector 
connects to i systematic nodes and k — i parity nodes for i = 0, ■ ■ ■ , A;. As mentioned earlier, for 
a sufficiently large field size, the composite matrix has non-zero determinant with probability 1 . 



V. Conclusion 

Using interference alignment techniques, we have shown the existence of vector linear Exact- 
Repair MDS codes that attain the cutset lower bound on repair bandwidth for all admissible 
values of (ra, k, d). We make use of the interference alignment scheme introduced in the context 
of wireless interference channels in ^ to provide insights into Exact-Repair MDS codes. 
Connecting the two problems allows us to show the existence of vector linear optimal Exact- 
Repair MDS codes in distributed storage systems. 
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